A Bayesian approach to DNA sequence segmentation.

نویسندگان

  • Richard J Boys
  • Daniel A Henderson
چکیده

Many deoxyribonucleic acid (DNA) sequences display compositional heterogeneity in the form of segments of similar structure. This article describes a Bayesian method that identifies such segments by using a Markov chain governed by a hidden Markov model. Markov chain Monte Carlo (MCMC) techniques are employed to compute all posterior quantities of interest and, in particular, allow inferences to be made regarding the number of segment types and the order of Markov dependence in the DNA sequence. The method is applied to the segmentation of the bacteriophage lambda genome, a common benchmark sequence used for the comparison of statistical segmentation algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving the Performance of Bayesian Estimation Methods in Estimations of Shift Point and Comparison with MLE Approach

A Bayesian analysis is used to detect a change-point in a sequence of independent random variables from exponential distributions. In This paper, we try to estimate change point which occurs in any sequence of independent exponential observations. The Bayes estimators are derived for change point, the rate of exponential distribution before shift and the rate of exponential distribution after s...

متن کامل

Molecular Identification of the Persian Gulf Sea Hare (Aplysia sp.) Based on 16s rRNA Gene Sequence

Background: Sea hares of the Aplysia genus are among the mollusks of interest for various researchers to study their phylogeny, bioactive compounds and the nervous system. These mollusks are herbivorous and produce chemical compounds (ink) to defend themselves. The present study provided molecular identification of the Persian Gulf (Bushehr city) sea hare using 16s rRNA gene sequence. Materials...

متن کامل

Pii: S0378-1119(01)00672-2

The concept of homogeneity of G 1 C content is always relative and subjective. This point is emphasized and quantified in this paper using a simple example of one sequence segmented into two subsequences. Whether the sequence is homogeneous or not can be answered by whether the two-subsequence model describes the DNA sequence better than the one-sequence model. There are at least three equivale...

متن کامل

Bayesian hidden Markov model for DNA sequence segmentation: A prior sensitivity analysis

The focus of this paper is on the sensitivity to the specification of the prior in a hidden Markov model describing homogeneous segments of DNA sequences. An intron from the chimpanzee α-fetoprotein gene, which plays an important role in embryonic development in mammals is analysed. Three main aims are considered : (i) to assess the sensitivity to prior specification in Bayesian hidden Markov m...

متن کامل

A Bayesian approach to discriminate between alternative DNA sequence segmentations

MOTIVATION As a result of recombination or rate variation, a DNA sequence alignment may have a mosaic structure, where different segments correspond to different evolutionary histories. While several methods have been developed to predict DNA mosaic structures, they do not properly address the question of whether the predicted segmentation itself is statistically significant, or whether it is s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Biometrics

دوره 60 3  شماره 

صفحات  -

تاریخ انتشار 2004